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1. Introduction 

The usefulness of the canonical ensemble in statistical mechanics is remarkable. The 
standard explanation for this success relies on taking the thermodynamical limit which 
corresponds to increasing the volume of the system to infinity while keeping all the 
relevant intensive quantities, i.e. densities, fixed and finite. From this point of view, the 
canonical ensemble should not have much utility for small systems consisting of only a 
few particles. This is not completely true and canonical expectation values can be used 
also for small systems although not very accurately in the direct manner used in 
thermodynamics. 

We shall try to generalize the ideas leading to the standard thermodynamics so 
that these methods could be applied in the analysis of small systems. We shall first 
concentrate on finding practical approximations which capture the large scale behaviour 
of an equilibrium system. For this purpose, we present in sections ^ and | a coarse 
graining procedure which justifies a modified Gaussian ensemble and we derive a positive 
saddle point (PSP) approximation which can be applied for further simplifying the 
Gaussian results. 

As an application of these methods, we present in section ^ how the PSP- 
approximation leads to a generalization of thermodynamics for essentially isolated 
systems. As a second application, we derive the recently proposed non-extensive Tsallis 
statistics as a PSP-approximation. We conclude on a discussion of implications to 
Tsallis thermodynamics and to the maximum entropy principle in the last two sections, 
I and |. 

2. Preliminaries in quantum statistics 

The standard approach to quantum statistics @] is defined by using a density matrix 
p, which is a non-negative, hermitian, trace-class operator normalized to one and which 
gives the expectation value of an observable A by the formula {A) = Tr(y4p). In some 
complete eigenbasis {ipi), the density matrix can thus be expanded as 



where the eigenvalues Pi are non-negative, Pi > 0, and they satisfy the normalization 
condition J2iPi = 1- 

Suppose next that the system has a discrete energy spectrum with finite 
degeneracies, which in quantum mechanics is achieved for every sufficiently binding 
potential. An equilibrium, i.e. time-independent, ensemble is then given by a density 
matrix which has time-independent eigenvalues and which satisfies the Liouville-von 
Neumann equation [H,p\ = 0. In this case, the eigenvectors ipi can be chosen so that 
they are also energy eigenvectors; let Ei denote the eigenvalue corresponding to ipi- 

Assume also that energy is the only relevant conserved quantity in this equilibrium 
system. By this we mean that the probability of finding any energy eigenstate depends 




(1) 
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only on energy and not on any other conserved quantum numbers, i.e. we require that 
Pi = Pj for every pair of indices with Ei = Ej. Then we can find a function / such that 
p = f{H), which is just another notation for demanding that pi = f{Ei) in (|l|). It is 
also obvious that in this case / can be chosen as a smooth, positive function such that 
JdE f[E) < oo. Then we can define a smooth probability density F by the formula 
F{E) = f{E)/ J f using which the density operator becomes 

P^^. (2) 

TtF{H) 

We shall call any smooth probability density F which satisfies (@) a fluctuation 
spectrum of the system. If we require that the energy spectrum is bounded from below, 
we can normalize the Hamiltonian so that the lowest eigenvalue is strictly positive and 
then choose a fluctuation spectrum with a support on the positive real axis. 

Finally, let us define a few mathematical tools needed in the following discussion. 
We shall denote the Gaussian probability density with mean and standard deviation 
A by Gx, i.e. 

Gx{x) = ^==exp 



For a pointwise application of inverse Fourier-transforms, we shall also need the class S of 
Schwarz test functions, also called rapidly decreasing functions. They are infinitely many 
times differentiable, i.e. smooth, functions which have the property that the function 
and any of its derivatives vanish faster than any power at infinity — the distribution Gx 
is a prime example of a rapidly decreasing function. For a more precise definition see 
e.g. chapter 6 of the book on functional analysis by Rudin [Q. 



3. An overview of the results for standard ensembles 

The purpose of this section is to give a brief review of the main results in the special case 
when they lead to the usual canonical and Gaussian ensembles. We have chosen this 
peculiar order of presentation, so that there would be a concrete application which can 
be kept in mind when going through the more abstract results in the following section. 
It also serves as an introduction to the methods we apply in the following section. We 
do not imply that these results are obvious — we shall go through their derivations in 
detail in section 

Let F be a fluctuation spectrum of an equilibrium system for which energy is the 
only relevant conserved quantity and let A be a positive observable. Assume also that 

Al. The system is canonical: its spectrum is bounded from below and Tre"^''^ < oo for 
all p>0. 

A2. The energy fluctuations decay faster than exponentially at high energies: e^^F{E) 
is a rapidly decreasing function for all real (3. 

A3. The observable is canonical: TT{Ae~^^) < oo for all f3 > 0. 
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We shall call a fluctuation spectrum satisfying the condition [A2| precanonical and we 
shall use the name precanonical system for those systems which are canonical and have 
precanonical fluctuations. Note also that by M we can choose F so that its support is 
bounded from below and thus |A2| is a restriction for the behaviour of the fluctuations 
only near E 



+00. 

The assumptions AT-pO| allow for all /3 > the integral representation 
Tx(aF{H) 



— F(u;)Tr( Ae" 



wH 



(3) 



where F is the Laplace transform of F and the integrand in the above equation is an 
analytic function in the half-plane Hew > 0. We can use saddle point methods for 
finding a better integration contour and approximations to the integral. In fact, usually 
there exists a unique positive saddle point Pa which will be the best choice for /5 in (|^) 
as the integration contour will then go through this saddle point via the path of steepest 
descent. 

The saddle point approximation around Pa will then yield "the best possible 
canonical approximation" of the trace. This saddle point approximation can be 
expressed in terms of gA{P) = {AH) /{A) and a\{p) = {A{H - gAf)/{A), where all 
expectation values are taken in the canonical ensemble at inverse temperature /3, and 
in terms of the following two characteristics of the fluctuation spectrum, 

a{P)f 



a{P) 



J dxF{x)e^^x 



and b'{p) 



J dxF{x)e^^[x 
J dxF{x)e 



(4) 



/ dxF(x)e^^ 
The saddle point equation is then 

aiP) = gA{P), 

and there is unique a positive solution. Pa, to this equation provided 

a(0) < lim^QAiP). 
When this is the case, the saddle point approximation around Pa yields 

Tr(AF{H)^ ^ [27r(62 + ^1)]'^ ^-PA+^b^Pl Tr^le"^-^^) , 

where all functions are to be evaluated at P = Pa- 

Suppose now that the positive saddle point approximation is good for the partition 
function, i.e. for A = 1, and let Pq be the corresponding saddle point value. Assume 
also that A is an observable which does not alter the canonical distribution much, 
g^(p) = {AH)i3/{A)fs ^ {H)f} and a\ ^ {H'^) - {H)^. Then we can use Po also for the 
saddle point approximation of Tt[AF{H)] and arrive at the usual canonical formula 



(5) 



(6) 



Tr 



AF{H) 



Tr 



TrF{H) Tre-^oH 

For large systems, we are typically only interested in "total" observables like total 
energy and total particle number. These observables depend only on the behaviour of 
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the system at the scale of its total energy and, therefore, we would expect to need only 
information about the behaviour of the system coarse grained up to a scale A which is 
less than the total energy but larger than the microscopic energy scales. 

For precanonical systems, it is possible to define a Gaussian approximation which 
will yield accurate results for most large scale|| canonical observables: if we smooth the 
fluctuation spectrum by taking a convolution with a Gaussian distribution with variance 
A, then after defining L = a(0) and = &^(0), we get in the limit A —>■ oo. 



Tr 



A{F*GA)iH) 



Tr 



AiGx*GA)iL-H) (l + 0(A-3)). 



(7) 



This choice of parameters L and is natural since they are the expectation value and 
variance of the probability distribution F, as can be seen from @. They are, in fact, 
also the best possible, since in theorem |l| we shall prove that any other choice of L and 
A^ guarantees only a slower convergence in 1/A. 

The precise conditions for an application of the theorem are discussed in section 
and one set of sufficient conditions is given by 



lim 



MP) 



oo, 



0^ 



(ii) PgAiP) and [3a\{[3) / qa^P) stay bounded in the limit [3 

It is relatively easy to check that these conditions hold, for example, for all energy 
moments {A = H^) when the system consists of particles in a harmonic potential. In 
addition, by (|]), condition (D also guarantees the existence of a positive saddle point 
solution and thus of a canonical approximation. 

If these two conditions hold also for A = 1, we can then see that the Gaussian 
ensemble yields an accurate approximation to the original ensemble 



Tr 



AF{H) 



Tr 



AGx{L-H) 



TtF{H) TrGxiL-H) 
at least for all sufficiently large scale observables A. 



4. The positive saddle point (PSP) approximation 

In the preceding section we anticipated the derivation of the canonical ensemble from 
a saddle point approximation under very general assumptions. Is there any need for 
non-canonical ensembles? In principle, the canonical approximation can fail in several 
different ways as the following examples illustrate. 

(i) The density of states increases faster than exponentially or is not discrete and thus 
the canonical ensemble is ill-defined. 

(ii) F decays only e.g. polynomially in energy. Then the above arguments relying on 
analyticity will not hold and the Gaussian approximation might not be applicable. 

I These can be defined as observables whose expectation value changes less than some preset limit 
when the fluctuation spectrum is coarse grained. 
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(iii) The addition of the observable A alters the canonical energy distribution too 
radically and either we cannot use [3a = Po or, even worse, Tr(y4e~^'^) = oo. 

(iv) Canonical ensemble gives too rough an approximation for the real statistics. This 
will always eventually happen when observables resolving ever smaller energy scales 
are inspected, unless the original fluctuation spectrum is exactly canonical. 

Let us now first concentrate on solving the "exotic" part of the problem, the 
exponential increase of density of states (cases | and |ii|) and the possibility of non- 
exponentially decaying fluctuations (case A solution in both cases is to reparametrize 
the energy so that in the new variables the asymptotic high-energy behaviour is 
sufficiently regular. 

Let g{E) be the function which performs this reparametrization, i.e. let g is be an 
orientation preserving diffeomorphism from the real line onto itself. Since we do not 
want to restrict the speed of the growth of the density of states in any way, we do not 
assume anything more of the diffeomorphism at this stage, only that it regularizes the 
high-energy behaviour of both the density of states and the fluctuation spectrum: we 
assume that in the reparametrized variables the density of states (with the effect of the 
observable included) does not increase exponentially and that the fluctuation spectrum 
decreases at least exponentially. 

Expressed mathematically, we assume that there exists real numbers /?_ and /3+ 
such that < < /5+ and 

Bl. For all /?>/?_, A is a positive (^-bounded observable: TT{Ae~^^^^^) < oo. 

B2. The reparametrized fluctuations decrease at least exponentially at high energies: 
e^^F{g^^{x)) is a rapidly decreasing function for all /3 < 

We also assume that these parameters have been chosen in the best possible manner, 

(3- = inf {(3 I Tr(le-^^(-^)) < oo}, (8) 
/3+ = sup{/?|e^^F(^?-i(x))G5}, (9) 

and thus either of them might be infinite. Note also that f3- depends on the choice of 
the observable A. 

It looks like we lost some generality at this stage. However, by adding a suitable 
part of the energy dependence to the observable, i.e. by replacing A by A^{H) where 
$(i?) is a suitable function, it is usually possible to find such a reparametrization g that 
P- and P+ satisfy the requirement /3_ < < /?+. For example, if the density of states 
increases like e^'^ choosing $(i?) = e~^'^ would allow still using g = id. 

Suppose now that we would like to approximate the behaviour of the system by 
using the reparametrized Gaussian ensemble, p = Gx{L — g{H)), or the reparametrized 
canonical ensemble, p = e~^^^^\ It would be natural to assume that the best 
approximation of this kind follows if we choose the parameters L and A as the 
reparametrized expectation value and variance of F and f3 from the saddle point 
approximation of the trace in the partition function. We will now concentrate on 
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deriving conditions under which this intuition is vahd and on finding estimates for 
the errors induced by the approximations. 

4-1. Canonical approximation of a trace 

It follows easily from assumption that the function 

/>oo 

F{w) = / dxe""'F(^-i(x)) (10) 



is well-defined and analytic in the region Hew < Since g was a diffeomorfism and 
we can typically choose F with a support on the positive real axis, we can usually also 
define this "gf-transform" of F by 



F{w) = / dEg\E)e3^^^'"F{E). 
Jo 



When g{E) = E this is just the Laplace transform and if g{E) = Ini? it coincides with 
the Mellin transform. 

As the inverse Fourier-transformation can be taken pointwise for rapidly decreasing 
functions, we have pointwise for all /3 < 

F(E) = I ^^P3iE)-P9iE)p^f^ _ ^ 1^^'" ^F{w)e-^3^''\ (11) 
J 27r J/3-ioo 27ri 



By applying the representation (0) we get then for any observable satisfying Bl an 
integral representation valid for (3- </?</?+, 

Tt{AF{H))= —F{w)TT{Ae-'"3^^^). (12) 

i/3-ioo 2771 

The boundedness of the trace also implies that the function Tr(74e~'"^^^'') is analytic in 
the half plane Hew > (3- and, therefore, the integrand in (|12D is an analytic function in 
the strip /3_ < Rew < /3+ and we proceed to consider its evaluation by the method of 
steepest descent. 

For writing down the saddle point approximation it will be useful to define 

iii/.f'ir/ i/ iif— /, ^ I I I I I ■ I 1 1 i/ iii— 1/ III III I ^ ^ 

a{w) 



fdxF(g-\x))e'"''x , , fdxF(g-^(x))e'"''\x-a(w)y 

j<lxF{g-^{x))e"'' jdxF{g~^{x))e"'^ 



g{w) = — — — and a'^{w) = — 



Tr(^Ae-"'9(^)j Trj^Ae-^'fW 

which are well-defined in the whole strip apart from the countable set of zeros of the 
denominators. 

In terms of these quantities, the saddle point equation becomes 

a{w)-g{w) = Q, (14) 
and the second derivative of the logarithm of the integrand is given by 

h''{w) + a^{w). (15) 
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For real values of w, both 6^ and cr^ are strictly positive and thus the second derivative 
becomes a strictly positive quantity on the real part of the strip. Therefore, the first 
derivative defining the saddle point equation is a strictly increasing function on the 
interval /?_ < w < /?+ and there exists a real saddle point on this interval if and only if 

a(/5_) <^(/3_) and a{(3+) > -g{(3+). (16) 

If the real saddle point exists, it is obviously unique and choosing [3 in (^) equal to 
this saddle point value will lead to an integration contour which goes through the real 
saddle point via the steepest descent path. 

Assume then that the real saddle point exists, denote it by (3 a-, and parametrize 
the integration variable in ( |T2l) as w = (3a + ia- If the integrand, as a function of 
a, is strongly concentrated in such a neighbourhood of the origin that the quadratic 
approximation of the logarithm of the integrand is admissible, then the saddle point 
approximation yields 

MAF^m) - ^ =F{PA)MAe-^^''% 

271 [13 a) + (y^ [Pa)) 

Note that this can happen, since the maximum of the absolute value of the integrand 
is at a = 0, although it cannot be guaranteed without further assumptions. For a 
derivation of bounds for the accuracy of the saddle point approximation, we shall need 
to make a detour via a Gaussian approximation. 

4-2. Gaussian approximation of a trace 

Suppose we want to coarse grain the fluctuation spectrum like in the previous section 
and that way deduce what kind of Gaussian approximation would be most accurate for 
large scale observables. The use of a Gaussian convolution is not justified now since 
we do not know if the resulting function will give a trace-class operator. Moreover, the 
transformed function might not satisfy the decay condition |B^ any more and we could 
not use the above integral representation for it. 

There is, however, a natural generalization of the usual coarse graining procedure 
which will give functions satisfying condition B^: we shall perform the coarse graining 



2A- 



-(g{E)-g(x)f 



by a convolution with a Gaussian distribution in the reparametrized space, 

F^[E)= [dyF[g~\y))GME)-y)= [dxF[x)g'[x)-l=i 
J J V27rA2 

It is obvious that this transformation satisfies the following semigroup-property, 

and that the (^-transform changes simply by multiplication, 
Fa[w) =e^^''"'F[w). 
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As the Gaussian distribution is a rapidly decreasing function, its convolution 
leaves the class S invariant. Therefore, if el^^F{g~^{x)) is rapidly decreasing, so is 
e^^ Fi^{g~^{x)), since clearly 



e^^F^{g-\x)) = e^^"^" I dye^yF{g~\y))G^{x - f3A' - y). 



This proves that if F satisfies condition for some /?+ then so does Fa. 
By the above results, we now have an integral representation 

Ti{AFk{H))= — e2^'"''F(w;)Tr(le-'"»(^)), (17) 

i/3-ioo 27ri 

which is valid for all 13- < (3 < (3+ and A > 0. We would now like to make a quadratic 
approximation of F{w) which would then lead to a Gaussian trace. In fact, this is often 
possible in the limit of large A, but since the discussion gets a bit technical at this point, 
we leave the proof of this to Appendix A and state only the results here. 



Define the functions a, 6^, g and by equations (p^. Then a(0) and 6^(0) are the 
expectation value and the variance of the reparametrized fluctuation spectrum and we 
proceed to show that these values yield the best possible approximation of a Gaussian 
form. If /5_ = 0, we also need to assume that the following three conditions are satisfied: 

Gl. a(0) < ^(0) 

G2. There exists a constant c > for which 

/5^^^'^(/5) stays bounded in the limit (3 0+. (18) 
G3. (3a'^{(3)/g{(3) stays bounded in the limit (3 0+. 

With these notations, the following theorem, whose proof we have included in 



Appendix A| , holds: 



Theorem 1 Let L = a(0) and = 6^(0) and assume that BJ_ and B2 are satisfied for 
some P- < < /3+. = and \GJ\^G^ hold, then in the large scale limit A — > oo for 

all c>0 satisfying (j7^, 

Tr(lFA(if)) = F(0) Tr [1g^^(L - giH))] (l + O(A-Tfc)) , (19) 

and if (3- < 0, then ( ff^ j is true for c = 0. 

In addition, for any other choice of L or only a slower convergence in this limit 
can he guaranteed. 

Since G^^r^^^^ = (Ga)a, we can thus conclude that, when the quality of the 
approximation is measured by expectation values of large scale observables, the best 
approximation of the Gaussian form for F{H) is given by 

F{H)^F{0)Gx{L-g{H)). (20) 

Therefore, we have proved that having either /5_ < or conditions |G 1HG3| satisfied will 



lead to the intuitive approximation mentioned earlier in section O 



The standard situation we examined in section |] implies that /5_ = 0, /?+ = cxd 
and g{x) = x. Thus we then need to satisfy the latter three conditions. In fact, it is 
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straightforward to see that the conditions (|) and ([]) given in section |^ do imply that 
Gl| is satisfied for any a(0), that we can choose c = in condition |G2|, and that p3| 



holds. Theorem ^ will then justify equation (0). 



4-3. Accuracy of the canonical approximation 



The canonical approximation was derived in section |4.1| from a real saddle point 
approximation of the trace. Estimating the accuracy of this approximation is, however, 
quite difficult. We shall in this section present estimates for the accuracy of the saddle 
point approximation in the special case when the fiuctuation spectrum is Gaussian. By 
the results of the previous section, this can then be used also for more general fiuctuation 
spectra, if the observable is of sufficiently large scale and the conditions for the use of 
the Gaussian approximation are valid. 

Assume thus that the fiuctuation spectrum is F{H) = G\{L — H), when P+ = oo, 
and let /3 be some value greater than We shall later see that the best bounds are 
obtained if P is the solution to the saddle point equation 

pX' + L-m = 0. (21) 

However, since we shall also need estimates for how far from the saddle point value (3 
can be chosen before the saddle point approximation loses its accuracy, we shall not fix 
the value of P yet. 

The bounds will be expressed in terms of the ratio R = \/a{[5). Since the 
derivations of the bounds are again quite technical, we postpone them into [Appendix B 



and present only the results here. The real saddle point approximation to the Gaussian 
trace is there shown to yield 

Tt[aG,{L - g{H))\ ^ e^^-^¥''G^^^^,^^[L + - m) Tr [Ac'^^^^)] , (22) 

and we call the right hand side "saddle point approximation" and denote it by "PSPA" 
even when (3 does not satisfy the saddle point equation (^Tj). 

The first bound for the accuracy of this approximation is obtained by using Jensen's 
inequality very analogously to what was done in . This shows that for any (3 the ratio 
of the Gaussian trace and its PSP-approximation is bounded by 



exp 



2(A2 + (t2(/5)) 



1 



^ Tv[AG,{L-g{H))] ^ / , 1 
PSPA 





r 1 1 


)%xp 


[ 2rA 



(23) 



These inequalities prove that, if (3 is close to the saddle point value and ^ 1, the 
relative error from the PSPA becomes negligible. 

The second bound for the relative error is given for ApgpA in 

TT[AG,{L-g{H))] 

PSPA -1 + ^PSPA 
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and it depends on the behaviour of + ia) near a = 0. Since cr^(/9) 7^ 0, we can for 
all < r < 1 find a > such that 



< r, for all — Pr < « < Pr 



(24) 



The bound is then expressed in terms of r and 



exp 



2(A2 + (t2(/3)) 
3 r 

< 



|A 



PSPAl 



1 



+ e 2 



1+1 



1 



(25) 



21 + i?n i + i?2. 

Clearly, the bound is informative only if A is so large that Ap^ > 1- 

If P is so close to the saddle point value that (L + PX'^ — g{(3))^ <^ cr^(/^) + and 
if r can be chosen so that |ApgpA| is small, then the PSP-approximation of a Gaussian 
trace is very accurate. By applying the first bound (P^D, we can see that this will happen 
after a coarse graining with a large enough A, if /5 can be chosen so that A ^ cr(/5). 
Unfortunately, unlike was erroneously claimed in section 4 in the last condition can 
not be satisfied in a typical thermodynamical limit and the more complicated second 
bound has to be applied in these cases. 



4-4- Expectation values 

So far we have inspected the approximation of one trace only. The statistical expectation 
values are ratios of two traces and as some of the terms cancel in the ratio, we can write 
down simpler results for the expectation values. To avoid confusion, we shall use the 
subscript A to denote the quantities related to the trace in the numerator (i.e. depending 
on the observable) and the subscript for the quantities related to the denominator, 
which is independent of the observable. 

We also take into account the possibility of having the behaviour of a part of the 
original fluctuation spectrum predetermined: If $(x) describes the known behaviour, 
we use a new fluctuation spectrum determined by /(x) which satisfies F{x) = /(x)$(x) 
and the observable = ^{H)A instead of the original fluctuation spectrum F{x) and 
observable A. Such a separation is sometimes practical as we shall see in section |^, or 
even necessary as we noted in the beginning of section |^. 

We shall assume in the following that the Gaussian approximation is valid for the 
observable A, when for parameters L = a(0) and A^ = &^(0), 



Tr 



{A) 



<i>AG4L-g{H) 



Tr 



'^Gx{L-g{H) 



(26) 



We shall then apply the PSP-approximation to the Gaussian traces. The saddle 
points Po and Pa are determined by the equations 



PoX^ + L - goiPo) = and PaX^ + L - uipA] 







(27) 
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and, in general, they are not equivalent. We estimated in the preceding section the 
accuracy of the PSP-approximation for the Gaussian traces and the bounds derived 
there can be used for determining if these approximations are valid. When this is the 
case, we get from ( p2D and ( [27| ) 

/ X2 , „2(o ^\\ Tr $Ae'3A(90-9{H)) 



A2 + ai(/5, 



Tr 



$e/3o(9o-ff(H)) 



(2^ 



where = 90%) = Tr ^g{H)e 



/Tr 



This result can be simplified for those observables, for which it is possible to use 
/3a = /5o- By the discussion in the previous section, this is possible at least when 
(^o(/3o) —9a{.Po)Y ^ + cr^(/5o), i.e. the addition of the observable does not change the 
value of g significantly. If also |o"^(/3o) — o'o(/5o)| <^ + (To(/5o), the expectation value 
can be approximated by the usual canonical formula, 



{A) 



Tr 



Tr (^e-Pog{H) 



(29) 



5. Essentially microcanonical systems and PSP-entropy 

Most applications of statistical methods to physical systems consider a large number of 
particles constrained into a fixed region of space, either by having them in a container 
(gases and liquids) with (potential) walls which prevent the particles from escaping, or 
by an attractive interaction between the particles (solids). In both cases, the interactions 
with the environment happen via the boundary of the container and it is usually 
plausible to assume that in an equilibrium the total energy fluctuations are negligible. 

This motivates the use of the microcanonical ensemble, where energy fluctuations 
are neglected entirely and the density operator is proportional to 5{H — E). However, 
in real systems, there are interactions with the environment, and although the energy 
fluctuations are small, they are not non-existent. One of the motivations for our 
inspection of the Gaussian ensembles was to develop methods for examination of the 
effect of these fluctuations. 

We want now to inspect how and when the fluctuations can be neglected. First, we 
shall assume that the fluctuation spectrum has a compact support, i.e. it is zero outside 
some finite interval, and we shall denote its mean and variance by 



E = j dxF(x)x and ^ j ~ 



(30) 



We shall also assume that the reparametrization g is essentially linear on the support of 
F, i.e. that we can use the approximation g{y) g{E) + [y — E)g'{E) when computing 
F. This leads to the approximations 

m^g'iE), a{0)^g{E) and b\0) ^ g'{Efe' 
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which become exact in the hmit e — > 0. Note also that since the support of F is compact, 
we have now /3+ = oo and the parameter range for (3 is thus given by /3 > 

Thus we can define the parameters of the generahzed Gaussian ensemble by 
L = g{E) and = g'{E)'^e'^, and the saddle point which yields the best canonical 
approximation becomes 

px'+L- m = 0. 

We would like to put A ^ in this equation, and consider only solutions to the simpler 
equation 

g{E)=m- (31) 

We have seen that this kind of change to the saddle point value is possible provided 
the new value satisfies (L + /3A^ — g{P))^ <^ A^ + cr^(/5), and thus the solution of 
is a good approximation if, for example, P^X^ -C Pcr{P). We assume now that e is so 
small that this approximation can be made and we shall call such systems essentially 
microcanonical. 

A straightforward application of the Lebesgue dominated convergence theorem 
reveals that Iimj3^oog{l3) = giEo), where Eq is the lowest energy for which the 
expectation value of the observable A is non-vanishing. For example, ii A = 1 then 
£"0 is the ground state energy. Since g is continuous and monotonely decreasing, we can 
then deduce that a necessary and sufficient condition for equation ( |3TD to have solutions 
P > P- is given by g{Eo) < L < g{P-) or, if expressed in terms of energy, 

E^<E< g-\g{P-))- 

The microcanonical entropy S^c is defined as the logarithm of the density of states. 
One possible quantitative definition for the density of states at energy E at the scale e, 
would be given by the number of states in the energy interval [E — \e,E + |e] divided by 
the length of the interval, e. Then the microcanonical entropy S^c{E, e) would be equal 
to InTrF(iJ), where F is the fiuctuation spectrum proportional to the characteristic 
function of the interval [E — ^e^E + . We shall generalize this a bit and inspect the 
approximation of In Tr F{H) for a general fiuctuation spectrum F. The Gaussian (pO|) 
and saddle point approximations (|2^ ) yield the estimates 



InTrF(^) ^In-pEL + lnTrexpf-^^^ — yP^) (32) 
v27rA^ V 2A / 

^ -lln(27r)-iln^l±^ + /5^(/?) + lnTre-^^(^), (33) 

where in the second formula we have used /3 defined by equation (|3T|) for the choice 
A = 1. We have also not included the possible prefactors $ which where introduced in 
section [4.4| — these would unnecessarily complicate the formulae and their inclusion only 
amounts to the replacements Tr Tr $. 
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In (|33|), the first two terms are logarithmic corrections depending on the scale of 
the fluctuations and they are usually dominated by the remaining terms. For reasons 
becoming apparent soon, we shall call the remaining terms the PSP-entropy, 



Tr g{H)e-f^3{H) 

Comparing this to the definition of the canonical Gibbs entropy, S'can, shows then that 
S'psp(/3; H) = ScaniP', g{H)), i.e. this is the same as the Gibbs entropy if the Hamiltonian 
H is replaced by the reparametrized energy operator g{H). 

One consequence of this identification is that the maximum entropy principle holds 
also for S'psp in the following form 

Theorem 2 (Maximum PSP-entropy) Suppose g{H) is a self-adjoint operator and 

E is a real parameter such that equation has a solution f3 > (3_. Then there is a 
unique positive operator p which maximizes the Gibbs entropy functional 

S[p\= Tr(-plnp) 
under the restrictions 

Trp = 1 and g{E) = TT{pg{H)) 
and this p has the canonical form 

^ ~ Tr e-f^9iH) ■ 

This theorem follows by an application of the standard result for the canonical ensemble 
when the self-adjoint operator H is replaced by g{H); for a proof of the standard case, 
see e.g. section 4.2.2 of 0. 

Define then (3{E) as the function which maps an energy in the allowed interval 
Eq < E < g^^{g{i3_)) to the solution of equation (|3T|) . An apphcation of the imphcit 
function theorem then shows that (3 is differentiable and that 

Since by equation (Q), d5'psp(/3)/d/3 = — /3(T^(/?), we can conclude that 

= M{E) (35) 

where we abuse the notation in the usual way and denote S^sp{E) = Spsp{P{E)). One 
immediate conclusion from this equation is that if /3_ < 0, then the inverse E{S) is 
unique only if the two regions, E < g^^{g{0)) when /? > and E > g~^{g{0)) when 
P < 0, are treated separately. 
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5.1. Choosing effective Hamiltonians by the maximum entropy principle 

We shall now show that it is possible to use the maximization of S'psp for choosing 
parameters in effective Hamiltonians. This method is a simple extension of the standard 
one — for an application of the standard method see e.g. chapter 9.3. of the book by 
Bahan g. 

The easiest way of applying the maximum entropy principle is through an effective 
potential f/, which in the standard case = id is called free energy and denoted by F. 
Define U by 

U{T;H) = -TlnTre-T^'(^), (36) 

for all T G M for which the trace convergences. A differentiation of this equation shows 
then that 

and thus we get the following corollary to theorem ^, 

Corollary 3 Let H be the Hamiltonian, let g be an energy reparametrization and let E 
be a parameter such that equation has a solution P{E) > (3- and define T = l/j3{E). 
If h is a self adjoint operator for which 

Tr {g{H)e'^9^^^ 

yi^; = 

then 



g{E) = (37) 



U'{T;h)>U'{T;H) (38) 
and the minimum is attained only for h = H . 

Note that in ( ^7\) the expectation value is of the original Hamiltonian. 

The theorem is usually used in the following manner: Suppose that the true 
Hamiltonian H is too complicated for the computation of the trace in U{T;H), but 
there is a physically plausible effective Hamiltonian H^s depending on some parameters 
{xi} for which the function U{T, {xi}; Hes) and its T-derivative U'{T, {xi}; Hes) can 
be computed. Assume also that it is possible to compute and invert equation (|37|) for 
one of the variables. By inserting the solution to U'{T, {xi}; Hcr) and minimizing the 
result with respect to the remaining variables, we get the best possible — in the sense of 
entropy — parameters for the effective Hamiltonian. 

As a curiosity, let us point out that U(T) can also be obtained from a Legendre 
transform of the reparametrized energy. When /5_ = 0, the inverse of S'psp (£^) is unique 
and if we denote it by -E'psp(S'), it is easy to check that U{T) defined by 

U{T) = mf s{g{E^sAS))-TS} 

is equal to (^). If < 0, the inverse is double valued, and the Legendre transformation 
of the first branch will define U{T) for T > and transformation of the other branch 
for T < 0. 
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6. The axioms of Tsallis statistics 

In the following sections we shall discuss a non-standard application of the PSP- 
approximation: we shall inspect the connection between the so called Tsallis statistics 
and a PSP-approximation with a certain non-trivial energy reparametrization. For this 
comparison, we shall present a few main properties of the Tsallis statistics in this section. 

In 1988, C Tsallis proposed a possible generalization of the principles leading 
to the Boltzmann statistics. Motivated by the usefulness of entropy functionals 
different from the Gibbs one in the analysis of fractal phenomena, he proposed a two- 
parameter generalization of the Gibbs entropy and expectation values of observables and 
then derived a generalization of the Boltzmann statistics using the maximum entropy 
principle. In the following years, the new results where applied to many interesting 
non-extensive phenomena, such as stellar polytropes 0, anomalous diffusion P, ^ and 
electron plasma columns p, ^ . Especially the results from a maximization of a Tsallis 
entropy functional for the electron plasma column were promising, since this gave results 
more consistent with experimental observations than a maximization of a similar Gibbs 
entropy. 

In these applications the principles were set up in the form of axioms, and the 
following are cited from 

Axiom 1 (Escort Probabilities) The system is described by W microscopic state 
probabilities Pi > 0, which can be used to define a density matrix p. These "escort 
probabilities" satisfy Yl^iPi = Trp = 1. 

Axiom 2 (q-Entropy) The entropy of the system is defined in terms of two real 
parameters k and q, 

rfl -I Trd'i -I 
SM = k ^'=:'^' = k^^^^. (39) 
1 — g 1 — Q' 

Axiom 3 (q-Expectation Value) The expectation value of an observable A which 
has an expectation value in the state number i is given by the formula 

w 

{A), = Y,p1a, = TT{Ap'^). 

1=1 

With these axioms a maximization of Sq while holding q and the g-expectation 



value of the Hamiltonian fixed will yield |T0|| the canonical Tsallis ensemble 



Pi = ^ max(0, 1 - (1 - g)/3,£,)i/(i-«), (40) 

where Zg enforces the normalization Y2Pi ~ ^ ^^"^ f^q results from using a Lagrange 
multiplier technique in the maximization. It is evident from this formula that in the 
limit g — i> 1 the result reduces to the standard canonical ensemble. 



These axioms are, however, slightly controversial. As has been noted before [11 



in this axiomatic version we do not have a proper normalization of the expectation 
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values, since typically {l)q ^ 1 ii q ^ \. Here we propose that the easiest solution, the 
so called third choice presented in |T^, is the correct solution. In section |^ we present 
some possible conditions under which the canonical Tsallis ensemble would offer a better 
description of the system than the canonical Boltzmann ensemble. However, for this we 
find it necessary that Axiom ^ is replaced by 

Axiom 3' (q-Expectation Value) The expectation value of an observable A which 
has an expectation value ai in the state number i is given by the formula 



^ Tr 



EW q 



We shall discuss the use and meaning of these axioms in more detail in section ^ but 
first we need to show how the canonical Tsallis ensemble can be obtained from a PSP- 
approximation. 



7. Tsallis statistics from a PSP-approximation 



For the definition of the reparametrization which leads to the Tsallis statistics we 
need a new energy parameter, Em- This parameter denotes the maximum energy the 
system can have and we shall take this into account explicitly by using a prefactor 
$(£') = 6{Em — E) in the traces. Physically, E^ could represent an energy beyond 
which the system would evaporate or, if we are using effective Hamiltonians, beyond 
which the effective description is no longer valid. For now, let Em simply be a parameter 
which bounds the allowed energy range from above. 

Naturally, it is necessary to have Em greater that the ground state energy Eq for 
this kind of restriction to make any sense. To simplify the following discussion we now 
assume that the Hamiltonian has been normalized so that the ground state energy is 
equal to zero; this can always be accomplished by replacing H hj H — Eq. Then we have 
Em > and we can define the reparametrization by g{E) = — In (1 — E / Em)- Applying 
this to equation shows that the canonical approximation is then given by 



{A) 



If we now define q and [3q by 

/5 



Tr 


A^{H){1 


- H/EmY 


Tr 




- H/EmY 



and Pg 



(1 - q)E„ 



(41) 



(42) 



then the right hand side of (^) becomes equal to the canonical Tsallis expectation 
values as defined by (^OD and Axiom 0. This alone would justify the expression for 
the Tsallis expectation values as an approximation to the original ensemble. It also 
gives a connection, via the saddle point equation, between the parameters of the Tsallis 
ensemble, q and and the characteristics of the original ensemble, Em, L and A^. 
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It does not, however, tell us when this approximation would be better than the usual 
canonical one. We shall come back to this in the following section. 

In order to get some feeling how the saddle point approximation behaves in this 
case, we shall now assume that the density of states increases polynomially and in the 
computation of the traces we shall apply the approximation 

/■oo 

Tr/(#)^c/ dxx"-VM, (43) 



Jo 

where uj defines the relevant energy scale for the increase of the density of states and c 
is a constant independent of /. This approximation is not quite as arbitrary as it looks, 
since it can be derived e.g. from the high-energy behaviour of a system of n harmonic 
oscillators. For energy observables A = equation ( ^3] ) yields 

Tr[M'e-^^(^^] ^ ciEjcoTEt l^dyy-^'-^l - yf oc < r(ntfc + ^+l) - ^^^^ 
Clearly, for all > we now have /?_ = — 1 and thus we need not worry about the 
applicability of the Gaussian approximation for large scale observables. 

We could now examine the accuracy of the canonical approximation to the Gaussian 



traces by using the bounds derived in section Going through these computations, 
however, is quite tedious and probably not very interesting, so we shall give a few 
numerical examples instead. 

By the approximation (0), the canonical expectation values are now 

(^^-^^ilTTlTr- (45) 

j=0 ^ + n+j 

where the parameter f3 is determined from the saddle point equation involving the 
digamma function ip^x) = ^lnr(x), 

L + pX^ = tP{n + p + l)-ij{p + l). (46) 

For instance, the expectation value and the variance of energy are then 

"-Ir-nV - ^ ^<^>^ <^^' 

and the percentage of fluctuations of the total energy in the canonical ensemble is, 
therefore, approximately a/(1 — p)/n. If we had used the Boltzmann ensemble here, the 
total energy fluctuations would have been given by l/y/n, and thus they would have 
been larger than in the Tsallis case by the factor of — p. 

We have given a few examples of what kind of errors the coarse graining and the 
Gaussian, PSPA and canonical approximations induce to an initially microcanonical 
fluctuation spectrum in table |1|. Of course, the Gaussian approximation is closest to 
the microcanonical case when A <^ 1 and the closer E/Em is to 1, the larger values of 
A can be used before the energy expectation value changes signiflcantly. The canonical 
Tsallis approximation has the expected behaviour, but the accuracy of the PSPA-result 
is surprisingly good, especially at reproducing the energy fluctuations of the Gaussian 
ensemble. We also want to point out that (3 can have negative values here even though 
this makes the density operator apparently singular. 
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Table 1. The effect of coarse graining and the computation of expectation values 
in the different approximations: the first row gives the expectation value of energy, 
(H)/Em, and the second row the percentage of energy fluctuations, Yai{HY/'^ / {H) . 
The three expectation values refer to equations (Eq), (ESh and respectively. 



Parameters Expectation values 



n 




A 


/3 


Pa 




1 


G\ 


PSPA 


Tsallis 


r 



O.i 


0.U5 


lb. 6 


on n 


0.65 


r\ Old 
0.96 


r\ 1 c no 
0.15U3 


n 1 C AT 

O.loUi^ 


A 1 C /I O 

0.1545 










33.3 






0.2345 


0.2329 


0.4049 


5 


0.5 


0.5 


1.81 


2.05 


0.93 


0.64 


0.6398 


0.6422 


0.6403 










2.26 






0.1912 


0.1885 


0.2525 


5 


0.99 


0.1 


-0.66 


-0.63 


0.03 


-1.90 


0.9899 


0.9843 


0.9355 










-0.61 






0.0010 


0.0336 


0.1043 


5 


0.99 


5.0 


-0.09 


-0.08 


3.84 


-0.10 


0.8450 


0.8450 


0.8456 










-0.08 






0.1603 


0.1603 


0.1625 


100 


0.1 


0.05 


155.7 


156.5 


1.00 


0.99 


0.3891 


0.3891 


0.3895 










157.3 






0.0554 


0.0554 


0.0780 


100 


0.5 


0.05 


71.4 


72.0 


0.56 


0.99 


0.5806 


0.5806 


0.5801 










72.5 






0.0316 


0.0316 


0.0646 


100 


0.99 


0.1 


0.47 


0.48 


0.10 


0.32 


0.9900 


0.9901 


0.9855 










0.49 






0.0010 


0.0008 


0.0120 



8. Towards Tsallis thermodynamics 

Consider the following experiment: take a macroscopic piece of material in thermal 
equilibrium and separate a small sample from it. Let the sample be contained in some 
finite volume by a finite potential and wait until it again reaches an equilibrium. Now 
the sample will have lost energy and possibly particles by evaporation from the surface, 
especially there will be no particles with enough kinetic energy to escape the potential 
barrier in the sample. If the original temperature was high enough, the average energy 
density would also decrease and one would expect the system to end up in a state with 
average energy per particle near the escape potential and with a sharp drop for higher 
energies. If the number of particles in the final sample is small enough, the broad 
high-energy tail indicated by the canonical ensemble will give an observable change to 
the predicted behaviour of the system. The example in section |^ shows that under 
these conditions the Tsallis statistics with a suitable choice of Em can offer a better 
alternative. 

There are also instances when the canonical ensemble is ill-defined, but we can 
still resort to the Tsallis ensemble with its finite energy cut-off. If the density of states 
increases faster than exponentially, the canonical ensemble cannot be used, whereas the 
Tsallis ensemble will always produce finite expectation values. Another, more surprising, 
example is given by systems with a "Coulomb-like" spectrum: these systems have a 
point spectrum only below certain value E^. while the spectrum above E^. is continuous 
or, what is relevant to our case, does not contain any eigenvalues. For these systems 
any trace-class operator of the form F{H) must have a cut-off at the energy Ec — thus 
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canonical ensemble will not work, but Tsallis ensemble with Em < E^ is well-defined. 

We now wish to examine what happens if the fluctuation spectrum F is essentially 
microcanonical, i.e. when the total energy fluctuations are small enough that the 
reparametrization g is essentially linear on the support of F and we can use the saddle 



point values from the approximate equation ([31|). Let E denote the expectation value 
of the distribution F, when necessarily E < Em, and let P{E) be the solution to the 
approximate saddle point equation, 



ln(l - E/Er, 



Tr 


ln(l - 


H/Em)e{Em-H){l-H/EmY 


Tr 


e{Em~H){l-H/EmY 





(48) 



The Tsallis parameters q and fig are then obtained from (3 and Em as in the previous 
section, by equation (^2|) . We shall also adopt the notation 



1 - (1 - '-''e{l-{l-q)P,H), 

when the escort matrix defined in section ^ is given by p = cD/ Tr u). 

Let us then assume that the canonical approximation is accurate for the energy 

observable: E ~ Tr(ifp'')/Trp''. In the examples in table |I| this is the case whenever 

the energy E lies near the cutoff Em- Using the definition (|49|) , we can then deduce that 

E TiQ , , 

1 - — = ^^r^. 50 

Em Tr^;-? ^ ' 

We showed in section ^that the PSP-entropy is an approximation to the logarithm 
of the density of states. In W . Applying the definition of /5 and of the Tsallis parameters 
then yields the formula 



(49) 



In ly ^ ^, 



psp 



\-q 

where in the last equality we have used 

1 / Tr^;« 



ln(l -E/E„) +lnTr^;5 



1 



In 



\-q 

. Thus the g-entropy satisfies for p 
1 



(irco'j'^J 

ujj Tr cD, 

(52) 



\ — q\ (Tr ujY ) ^ — 1 
This relation between the density of states and the Tsallis-entropy was one of the 
motivations used in the original article for the definition of Sq. 

The other results proven in section |] apply here as well, especially we can conclude 
that the energy dependence on changes of /3 while holding Em fixed satisfies 

^lEm d 



d(3 



:E. 



(53) 



1 - E/Em d(3 

Interestingly, the Tsallis parameters yield a similar equation if we fix q and let jSq vary. 
For this, define Pq{E, q) as the solution to the equation 

Tr(^cD«) 



E 



Then a straightforward but lengthy computation shows that 



d ln(l + (1 - q)Sq 
df5q l-q 



d 



q)PqEdPq 



E 
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which by (|5l| ) and (^2]) is essentially the same equation as in (^). 

These results clarify the difference between the use of the Tsallis parameters and 
the PSP-parameters. In the first case, the increase of the fluctuation spectrum, defined 
by g, is a known quantity, while the boundary conditions determine the position of the 
energy cut-off, defined by (3q. In the PSP- approximation, it is more natural to vary 
the energy of the system while holding the energy cut-off fixed. Otherwise, these two 
methods are very similar, as both have a simple formula for the change of energy and 
an maximum entropy principle which can be applied for a determination of effective 
Hamiltonians, as explained in section ^ 

In real experiments performed on a sample of matter in a container, one usually 
examines the properties of the sample matter by altering some easily controllable 
physical conditions of the environment. For example, if the temperature of the container 
is the varying quantity, then a natural parameter for the experiment would be the 
energy density of the sample matter at the boundary. Unfortunately, unless the state 
of the system is homogeneous, the energy density on the boundary is not determined 
trivially from the total energy given by the Hamiltonian. It is, of course, possible that 
the boundary conditions can be included naturally to the dynamics by using effective 
Hamiltonians and this would then determine the relations between the parameters of 
the experiment and the parameters of the PSP-approximation. 

There has already been a lot of discussion of the physical meaning of the Tsallis 
parameter q in the literature. For example, for systems exhibiting a multifractal 
behaviour the value of q could be quite straightforwardly determined as was shown 
|TB|. The value of q can be also related to the degree of non-extensivity of energy and 



m 



the speed of growth of certain classical long-range potentials [14 



9. Speculations about the maximum entropy principle 

As mentioned earlier, the Tsallis ensemble is usually derived by postulating the Tsallis 
entropy functional (p9D and then finding its maximum under the condition that the 
energy expectation value is fixed. This approach is motivated by the standard derivation 
of the canonical ensemble by a similar maximization of the Gibbs entropy functional. 

The motivation behind these derivations lies in the intuitive association of the 
maximum likelihood ensemble with the least restricted, maximum disorder, ensemble. 
Since for systems with infinitely many degrees of freedom the natural choice of 
associating the maximum likelihood with equiprobability is not possible, it is then 
necessary to postulate an entropy functional which quantitatively measures the disorder 
of any density operator. The Gibbs entropy functional 5'[p] = Tr(— p In p) can be derived 
by requiring the entropy to be additive for independent systems. The additivity of 
the thermodynamical entropy, on the other hand, is closely related to the extensivity 
of thermodynamical systems — see, for example, the axiomatic derivation of Lieb and 
Yngvason which clearly highlights the role of extensivity. Since the Tsallis entropy 



is only sub- or superadditive P], by this reasoning it would be useful only for systems 
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which are non-extensive. 

In this work we have adopted a different implementation of the maximum disorder 
principle: by coarse graining of energy we try to find a simple parametrization of the 
underlying ensemble which would have all the same relevant predetermined properties as 
the original ensemble. The coarse graining will naturally increase the number of energy 
states which participate in the ensemble and thus we are, in a way, always increasing 
the microcanonical entropy of the ensemble. 

We now claim that in spite of the similarities, the present approach is conceptually 
easier and more readily applicable to a wider variety of situations. First, the possible 
fluctuations of the parameters can also be properly taken into account in this formulation 
and the effect of these fluctuations can be analysed. Secondly, although the energy 
reparametrization g, which is responsible for the appearance of non-standard ensembles, 
plays the same role as the different entropy functionals in the maximum entropy 
derivations, its interpretation is more tangible and, consequently, choosing between 
different non-standard ensembles should be easier. 

We would also like to emphasize the role of entropy differently here. Instead of 
a fundamental, philosophical role the entropy has been endowed, we would like to 
emphasize its practical importance both as a function encoding the dependence of energy 
on the temperature-parameter and as an effective potential which enables one to choose 
parameters in effective descriptions of the system. 

As we have shown, our definition of PSP-entropy, (0), leads to an extension of 
both of these properties. The relation between the changes of total energy and PSP- 
entropy are given in (P5|), from which the standard case follows by setting g = id. The 
maximization of the statistical PSP-entropy was proven in theorem ^ and its corollary 
provides a way of choosing between effective Hamiltonians. The Tsallis axioms also offer 
another, slightly different, way of applying the maximization of entropy which leads to 
one of the canonical PSP-ensembles. 

10. Conclusions 

We have presented a generalized coarse graining procedure which can be used for 
analyzing a wide variety of quantum systems. The main purpose of the coarse graining 
was to develop simple parametrizations which could be used for analyzing the behaviour 
of the system when its energy is varied. We restricted to the energy only, but the method 
can be easily extended to the case when there are many relevant conserved quantities. 

One important application was to systems for which the Boltzmann ensemble either 
does not converge, or produces too large total energy fluctuations. We have shown 
that in this case the Tsallis statistics offers a generalization which is more likely to 
work, but which still retains some of the useful properties of classical thermodynamics: 
maximization of an entropy functional and a simple formula for the differential change 
of energy. The cost in this case is the inclusion of an extra parameter which needs to be 
chosen in a suitable way dependent on the behaviour of the system — the traces involved 
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in the computations are also likely to be prohibitively difficult to compute. 

In the final section we have raised some objections to the use of the maximum 
entropy principle as a fundamental justification of the canonical Boltzmann ensemble. 
Especially for purposes of physics teaching, it has the disadvantage that it either leads to 
the idea that canonical ensemble is the only sensible statistical description of an arbitrary 
physical system or that thermodynamics can only be applied in the thermodynamical 
limit in which all the standard ensembles agree. We consider both of these ideas to be 
misguidingly restrictive. 

The general guidelines for choosing an ensemble could be the following: the number 
of parameters should be as few as possible and have as direct as possible relation to the 
physical characteristics of the system. In addition, the traces needed for the evaluation 
of the ensemble should be computable for the given Hamiltonian and, naturally, the 
ensemble must reproduce the original ensemble within the required accuracy. Of the 
coarse grained ensembles we considered, it should always be possible to find a Gaussian 
ensemble which is accurate enough. The Gaussian expectation values, however, are 
usually difficult to evaluate, while the corresponding canonical ensembles are often easier 
in this respect. Of the canonical ensembles, the Boltzmann ensemble is the simplest but, 
when it cannot be used, the canonical Tsallis ensemble is a good second trial. 

There is, however, one more possibility we have not stressed yet. From table 
|I] we can see that the PSP- approximation of expectation values, equation (|28|), 
can give significantly better approximation to Gaussian expectation values than the 
canonical ensemble. On the other hand, for the simple case g{x) = x, everything 
needed in the PSP-approximation formula is given by expectation values in the usual 
canonical ensemble. Using the PSP-approximation would thus enable the approximation 
of Gaussian expectation values without having to compute anything but canonical 
expectation values — this might be especially useful for small samples of macroscopic 
matter for which the canonical expectation values are computable, but non-accurate. 
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Appendix A. Proof of the Gaussian approximation theorem 

We shall begin the proof of theorem |1| from equation (p!7| ) which was shown in section 
[4.2| to follow from the assumptions ^T| and P^ . In other words, we inspect the integral 
representation 

Tt{AFj,{H))= T"^'"— ei'^'"''F(w;)Tr(le-'"»(-^)), (A.l) 
i/3-ioo 27ri 
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which is vahd for all /3_ < /3 < /3+ and A > with an integrand analytic in the strip 
/9_ < Reiu < The proof will depend on the properties of the real saddle point 
approximation in the large A limit so we shall now first examine the large A dependence 
of the real saddle point value. 

By differentiation of the logarithm of the integrand we arrive at the saddle point 
equation 

K^w + a{w)-g{w) = Q, (A.2) 

where the functions a and g are defined in ([131) . A differentiation of the left hand side 
yields then A^ + h'^{w) + which is clearly strictly positive for real w. Thus the 

restriction of the left hand side to the real axis is strictly increasing and we have a real 
saddle point [3 on the interval /5_ < /3 < /3+ if and only if 

A2/?_ + a(/?_) < g{P_) and + a(/5+) > g{p+). (A.3) 

Since /3+ > 0, the second inequality is satisfied as soon as A becomes large enough. If 
< 0, then the same holds for the first inequality as well, but if /5_ =0, then we need 
to have a(0) < g{Q) which is equivalent to the condition |G1| of section ^l2 . 



Let us now assume that either of these conditions hold and let Ac > be the 
infimum of values for which ( |A.3| ) is valid. Then for all A > A^ there is a unique 



real saddle point which we denote by /5(A). As the left hand side of ( |A.2|) is strictly 
increasing, we can also deduce that /?(A) never changes sign: 

(i) if a(0) < ^(0), then /5(A) > 0, 

(ii) if a(0) = ^(0), then /?(A) = 0, 

(iii) if a(0) > ^(0), then /3(A) < 0, 

for all A > Ac. 

An application of the implicit function theorem to the defining equation (|A.2| ) 



proves that /3(A) is a smooth function from (Ac, oo) to (/3_, /3+). A differentiation of the 
defining equation then shows that 

from which we can deduce that (3 is either monotonely decreasing positive (a(0) < (^(0)), 
zero (a(0) = g{Q)) or monotonely increasing negative (a(0) > ^(0)) function of A. In all 
three cases, the limit /3oo = hniA^oo /5(A) exists and belongs to the interval (/3_,/3+) if 
/?_ < 0, and to the interval [0,/3+) if /3_ = 0. 

We will next show that /3oo = 0. Assume, to get a contradiction, that /3oo 7^ 0. 
By the above results, then /3oo always belongs to the interval (/3_,/3+) and thus both 
limA^oo ^7(/5(A)) and limA^oo a(/5(A)) exist and are finite. But taking the limit A oo 
in the defining equation ( |A.2| ) then shows that we must have /3(A) — > 0, which is a 



contradiction. 

Thus we have shown that the saddle point solution goes to zero smoothly and 
monotonely as the scale A is increased. Since we are only interested in the large scale 
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behaviour, we shall next expand In F in a neighbourhood of origin; this is possible since 
F is analytic and -F(O) 7^ 0. The expansion yields 

F{w) = F(0)e^"'+^^'"''(1 + AF(«;)), (A.4) 
where L = a(0), 6^ = A^(0) and the correction term AF satisfies 

hm \—- < 00, (A. 5) 

for k = 3. 

Let us then consider the expansion ([A.4|) when L and are not necessary equal 
to the default values a(0) and 6^(0). It is easy to deduce that if 7^ ^^(0), ( |A.5| ) is 
true only for k <2 and if we have L 7^ a(0), then (|A.5|) holds only for k < 1. However, 
as is evident from the definition (p!OD, < -F(Rew) always, and we get from the 

definition ( |A.4D a bound 

1 2x2 F(3) 

|AF(/5 + ia)|e-2-^ <1 + , (A.6) 

which is valid for all and all real a, L and A^. 

For the proof of the theorem, we need to consider the difference 

Ag = TriAF^m - F(0) Tr [aG^^^{L - g{H)) 

= r""'" — Tr(le-"'»(^))F(0)e^"'+^(^'+^')"''AF(M;), (A.7) 

J/3-ioo 27ri 

where we have used the (yf-transform of the Gaussian trace, the integral representation 
( |A.1| ), and defined the function AF{w) by equation ( |A.4| ). For each A > 0, let us 
parametrize the integration variable in (|A.7| ) as w = (3 + ia. To prove the convergence 
properties stated in the theorem, we need to choose /3(A) so that it decays like a negative 
power of A: we assume that c > and that f3 has been chosen so that 

P^^'^A stays bounded in the limit A — > 00. (A. 8) 

In particular, this means that /5 — > when A — > 00. 

The absolute value of the correction term then has an upper bound 

\Ag\ < F(0)e^^+i^'("'+^') Tr lAe-^^^^^] H — e-i(^'+^')">F(/? + ia)| (A.9) 

L J J -00 2vr 

and we shall inspect the behaviour of the remaining integral next. 

Let k be such that ( |A.5| ) is true; as was mentioned earlier, we can always choose at 

least any k <1. Then there are constants M > and m > such that for all \w\ < 2m, 

\AF{w)\ < M|u;|^ 
Assume also that A is so large that \(3\ < m. Then for all |a| < m, 

A^|AF(/5 + ia)| < MI/3ATTS +i(aA)A-^|'= < M{\f3\A^^ + |a|A)^ 
which implies that 

ATT^V27r(A2 + A2) r ^e-i(^'+^')"VF(/3 + ia)| 
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stays bounded in the limit A — oo. In fact, the same is then true also when the 
integration limits are replaced by ±00, since the remaining integral over the values 
|a| > m does not contribute at all to the limit A — cx); to see this, apply dominated 
convergence theorem to the remainder. 
By equation ( [A .91 ) this proves that 



< ^ ^ 0(A~— (A.IO) 

F{Q)Ti[AG^^{L-g{H))] Tr{Aexp[- {L-g{H)Y]} 

where k and c are any values allowed by (|A.5|) and (|A.8| ), respectively. In the final part 
of the proof we derive conditions when the remaining multiplicative term stays bounded 
for large A. 

Define first an auxiliary variable x by the equation x = L + f3{\^ + A^). Applying 
this in ([A.IOD , reveals that the multiplicative term is the inverse of 

^ 1 (A.ll) 

Since (|A.11|) is always less than one, it cannot improve the convergence of the right 
hand side in ([A.10|) . Thus it is enough to show that it does not spoil the convergence 
for the optimal choice L = a(0). 

Since A is positive, Jensen's inequality can be applied in ( |A.11| ) and we obtain that 
the logarithm of ( |A.11|) is always greater than or equal to 

1 TT[Ae-^3(S\x - giH)Y] _ {x - m? ^HP) . , . ,1 

2(A2 + A2) Tr[le-/39(^)] 2(A2 + A2) 2(A2 + A2)" ^ " ' 

Both of the terms in ([A.12|) are negative and thus they have to be bounded separately 
for the boundedness of the multiplicative term in (|A.1CI|) . 
The first term is small only if (3 satisfies 



\L + (3{X' + A') - gm < VWTT^. (A.13) 

If we multiply this by jS^'^'^'^ and take the limit A ^ 00, it becomes evident that the 
conditions (|A.13|) and (|A.8|) can be compatible only if (3^^'^'^g{[3) stays bounded in the 
limit /3 — >^ 0, i.e. only if |G2| holds with the same value of c. 

The converse is also true in the sense that if c > is such that |G2| holds, then we 
can find /?(A) which satisfies both ( |A.13|) and ( |A.8| ). Such a /5 is defined by the equation 

L + /?(A2 + A2)-^(/5) = 0, (A.14) 

as we shall now show. A brief refiection shows that for a Gaussian fiuctuation spectrum 
F{H) = G,jj2:g^{L - g{H)), we would have a{w) = L + w{\^ + A^) and thus ( [70^ is 
the saddle point equation for a Gaussian fiuctuation spectrum. By the results proven in 
the beginning of this section, we then know that if /5_ < or L < g{0), ( |A.14D defines 
for all large enough A a smooth function /5(A) which monotonely approaches zero as A 
increases. Such /5(A) trivially satisfies (|A.13|) and by multiplying (|A.14|) by /J^'''^^ we 
can see that |G2| implies then that also (|A.8| ) is true for the same value of c. 
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Since it is part of the assumptions so far that either /3_ < or a(0) < (^(O), we 
have thus shown that if holds, then — at least for the optimal choice L = a(0) — it is 
possible to define /3(A) by ( |A.14|) and this function satisfies the decay condition ( |A.8|) . 
Also note that if /?_ < 0, then g{0) < oo and |G2| is trivially satisfied for the best allowed 
choice c = 0. 

For the boundedness of the second term in ( A.12| ), we shall need either /5_ < or 
assume the last of the three conditions, |G^. If /3_ < 0, then cr^(O) < oo and the second 
term vanishes in the limit A oo. If = and L < g{0), we can define f3 by equation 



( |A.14| ) when 



A2 + A2 gil3)-L' 

and clearly it is then sufficient that (3a'^{(3) /g{(3) stays bounded in the limit (3 0+. 
Note that this proves the sufficiency of condition |G3| for the relevant case L = a(0). 

Combining the results we get then from equations (|A.7| ) and ( |A.10 ) that if either 
/?_ < or pTHG3| hold, then for L = a(0) and = ^^(o) we have 



Tt{AFa{H)) = F(0) Tr 



AG, 



{L-g{H)) 1 + 0(A-T+ 



VA2+A2 1 

with A; = 3. We have also seen that changing A would allow using only k = 2 and 
changing L only A; = 1 or nothing at all — if /3_ = and L > g{0), there are no solutions 
to equation ( |A.14| ) and the logarithmic bound we have in ( |A.12[ ) becomes non-conclusive. 
This gives a precise meaning for the statement at the end of the theorem and completes 
the proof. 



Appendix B. Derivation of the bounds for the PSP-approximation of a 
Gaussian trace 



In this section we first derive the "saddle point approximation" (^) for the Gaussian 
trace and then show how the two bounds for the accuracy of this approximation, (p3D and 



(P5|), can be obtained. As in section [4.3| , let L and A be some parameters which define 
a Gaussian fiuctuation spectrum. Then /5+ = oo and we shall inspect the "canonical" 
approximation of the Gaussian trace for any /?>/?_. Let us also define R = \/a{(3). 
By equation (|T^ we can then use the integral representation. 



Tr 



AG,^{L-g{H)) 



— e"2^ " TrMe 
27r ^ 



-f3g{H)^ia{L+f3\^-g(H))^ 



(B.l) 
0, where 



The PSP-approximation is now derived by expanding ln/(/5 -|- ia) around a 
f{w) = Tr(Ae-"'f(^)). 

f{w) is an analytic function in the region Reif > /3_ whose restriction to the line 
Imw = is a strictly positive function. Although the logarithm of / is possibly not 
well-defined in the whole region, the following representation is nevertheless valid for all 
(3 > (3- and a G M, 

"1 



/(/3 + ia) = /(/3)e 



-iag(P) 



exp 



-a 



dt(l -t)a^(/3 + iat) 



(B.2) 
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where —g{w) and ct^(w), as defined by (|13|), are the first and second derivative of In /. If 
there is a singularity on the integration contour in ( |B.2| ), the integral has to be evaluated 
by an infinitesimal deformation of the contour. 

Since o"^ is analytic and cr^(/5) > 0, we can for all < r < 1 find a p > such that 



am 



< r, for all — p < a < p. 



(B.3) 



Let then |a| < p and define z hj z = ^{r - 1)(t^{(3) + J^dt{l - t)cr^{(3 + iat). By ( |R3|) , 
now Kez >0 and \z\ < ra'^{j3). But since for every complex w with Rew > it is true 
that |e~'^ — 1| < we have also the inequality 



1 

e-2' 



< 




+ 









_^ 1 
e 2 



raV2{/3) 



A comparison with ( p.2| ) then shows that we have proven the following bound for the 
quadratic approximation of /, 



/(/3 + i«) 



/(/5)e- 



-iag{/3)-ia2<72(/3) 



1 



< ^raV(/3)exp (|raV2(/5)) 



(B.4) 



valid for all \a\ < p. Since always + ia)! < it is also obvious that for all real 

a the left hand side is bounded by 



exp + 1. 



(B.5) 



Applying the quadratic expansion of / to (p.l|) we get 



Tr 



AG>^{L-g{H)) 

X (1 + Apspa), 
where ApspA is defined by 

By (PJ) and then 



(B.6) 



°° da _i 
e 2 



A2a2+i«(L+/3A2) 



27r 



m 



-iag(/3)-iaV2(/3) 



G^j^{L + pX^-g)\ApsPA\ 

3 2 r da 2 -i 
< -ra / — a e 2 



a2(A2+(l-r)cr2) 



< 



/27r 



p 

3 r 
L2a 



27r 



da ( 
2^ 



\Q 2 



Q^A^ 



e 2 



3 _i 

"2 e 2 

+ 



p2A2 



A 



+ 



e 2 



p2(A2+a2) 



TFT 



(B.7) 



where we have extended the first integral over the whole real line and used 
-'^^^ — -\/27r and approximated the second integral as in 



J dxx^ exp[— 



da _i^2;^2 



|a|>P 



27r 



-e 2^ 



2 / ^^-\^y+p?>^' < e-l^'^' 
27r 



d-U 

— e 2f 
27r 



iA2 



The second bound (|2^) follows now easily from 
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exp 



The first bounds, inequalities (0), are a straightforward consequence of the result 
- [L + p\'-m? _ 1 < Tr{Iexp[-^p^(L-^(g))^]} ^ ^ 



which was derived by using Jensen's inequality near equation (|A.11|) in [Appendix A 
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